NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Downstream Task Guided Masking Learning in Masked Autoencoders Using Multi-Level Optimization

Guo, Han; Hosseini, Ramtin; Zhang, Ruiyi; Somayajula, Sai Ashish; Chowdhury, Ranak Roy; Gupta, Rajesh K; Xie, Pengtao (April 2025, Transactions on machine learning research)

Masked Autoencoder (MAE) is a notable method for self-supervised pretraining in visual representation learning. It operates by randomly masking image patches and reconstructing these masked patches using the unmasked ones. A key limitation of MAE lies in its disregard for the varying informativeness of different patches, as it uniformly selects patches to mask. To overcome this, some approaches propose masking based on patch informativeness. However, these methods often do not consider the specific requirements of downstream tasks, potentially leading to suboptimal representations for these tasks. In response, we introduce the Multi-level Optimized Mask Autoencoder (MLO-MAE), a novel framework that leverages end-to-end feedback from downstream tasks to learn an optimal masking strategy during pretraining. Our experimental findings highlight MLO-MAE's significant advancements in visual representation learning. Compared to existing methods, it demonstrates remarkable improvements across diverse datasets and tasks, showcasing its adaptability and efficiency. Our code is available at https://github.com/Alexiland/MLO-MAE
more » « less
Free, publicly-accessible full text available April 11, 2026
Unleashing the Power of Shared Label Structures for Human Activity Recognition

https://doi.org/10.1145/3583780.3615101

Zhang, Xiyuan; Chowdhury, Ranak Roy; Zhang, Jiayun; Hong, Dezhi; Gupta, Rajesh K; Shang, Jingbo (October 2023, CIKM '23: Proceedings of the 32nd ACM International Conference on Information and Knowledge Management)
PrimeNet: Pre-training for Irregular Multivariate Time Series

Chowdhury, Ranak Roy; Li, Jiacheng; Zhang, Xiyuan; Hong, Dezhi; Gupta, Rajesh; Shang, Jingbo (January 2023, Proceedings of the AAAI Conference on Artificial Intelligence)

Real-world applications often involve irregular time series, for which the time intervals between successive observations are non-uniform. Irregularity across multiple features in a multi-variate time series further results in a different subset of features at any given time (i.e., asynchronicity). Existing pre-training schemes for time-series, however, often assume regularity of time series and make no special treatment of irregularity. We argue that such irregularity offers insight about domain property of the data—for example, frequency of hospital visits may signal patient health condition—that can guide representation learning. In this work, we propose PrimeNet to learn a self-supervised representation for irregular multivariate time-series. Specifically, we design a timesensitive contrastive learning and data reconstruction task to pre-train a model. Irregular time-series exhibits considerable variations in sampling density over time. Hence, our triplet generation strategy follows the density of the original data points, preserving its native irregularity. Moreover, the sampling density variation over time makes data reconstruction difficult for different regions. Therefore, we design a data masking technique that always masks a constant time duration to accommodate reconstruction for regions of different sampling density. We learn with these tasks using unlabeled data to build a pre-trained model and fine-tune on a downstream task with limited labeled data, in contrast with existing fully supervised approach for irregular time-series, requiring large amounts of labeled data. Experiment results show that PrimeNet significantly outperforms state-of-the-art methods on naturally irregular and asynchronous data from Healthcare and IoT applications for several downstream tasks, including classification, interpolation, and regression.
more » « less
Full Text Available
TARNet: Task-Aware Reconstruction for Time-Series Transformer

https://doi.org/10.1145/3534678.3539329

Chowdhury, Ranak Roy; Zhang, Xiyuan; Shang, Jingbo; Gupta, Rajesh K.; Hong, Dezhi (August 2022, Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining)

Full Text Available
ESC-GAN: Extending Spatial Coverage of Physical Sensors

https://doi.org/10.1145/3488560.3498461

Zhang, Xiyuan; Chowdhury, Ranak Roy; Shang, Jingbo; Gupta, Rajesh; Hong, Dezhi (January 2022, Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining)

Full Text Available

Search for: All records